Application of loudness/pitch/timbre decomposition operators to auditory scene analysis
نویسندگان
چکیده
We proposed[1] nonlinear operators which decompose a changing energy of sound in wavelet domain into three orthogonal components: i.e., loudness and pitch as coherent changes, and timbre as incoherent change. We showed that they could detect the discontinuity of a single sound stream with excellent temporal resolution and sensitivity. In this paper, we extend the coherency principle so that it can describe and pursue the individual coherency of non-overlapping sound streams in wavelet domain. It is realized by Parzen’s non-parametric estimates and Kalman filtering of loudness change rate and pitch shift rate. Using this method, we show some experiments for extraction of the most salient stream from multiple sound streams.
منابع مشابه
Models of Timbre Using Spectro-temporal Receptive Fields: Investigation of Coding Strategies
Timbre designates all of the perceptual characteristics of sounds that cannot be described as pitch, loudness or duration. Behavioral experiments combined with multidimensional scaling techniques have proposed that a few main acoustic dimensions subserve the perception timbre for homogeneous ensembles of sounds (e.g., Western musical instrument sounds). It is unclear however whether these dimen...
متن کاملHuman Echolocation in Static Situations: Auditory Models of Detection Thresholds for Distance, Pitch, Loudness and Timbre
We investigated, by using auditory models, how three perceptual parameters, loudness, pitch and sharpness, determine human echolocation. We used acoustic recordings from two previous studies, both from stationary situations, and their resulting perceptual data as input to our analysis. An initial analysis was on the room acoustics of the recordings. The parameters of interest were sound pressur...
متن کاملInfluence of pitch, loudness, and timbre on the perception of instrument dynamics.
The effect of variations in pitch, loudness, and timbre on the perception of the dynamics of isolated instrumental tones is investigated. A full factorial design was used in a listening experiment. The subjects were asked to indicate the perceived dynamics of each stimulus on a scale from pianissimo to fortissimo. Statistical analysis showed that for the instruments included (i.e., clarinet, fl...
متن کاملNonlinear time-frequency domain operators for decomposing sounds into loudness, pitch and timbre
In this paper, we propose a method for decomposing instantaneous changes of sounds into three energy components, i.e., loudness, pitch, and timbre. These operators are derived from an eigenstructure analysis of the time-frequency gradient space (a 3-D space spanned by a modulus and partial derivatives of a wavelet transform). By several experiments, we found that they have superior resolution a...
متن کاملPerceived Match between Visual Parameters and Auditory Correlates: an Experimental Multimedia Investigation
This paper investigates the relationship between the auditory and visual components of an audio-visual (A-V) composite. Participants (N=28) were asked to rate the perceived degree of “match” between A-V components in a series of randomly presented composites. Manipulated audio parameters included pitch, loudness, timbre, and duration, while visual parameters included color, vertical location, s...
متن کامل